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(54) TiUe: A METHOD OF DETECTING BIOLOGICALLY ACTTIVE SUBSTANCES 



(57) Abstract 

The present invention relates to a method of detecting a biologically active substance affecting intracellular processes, the method 
comprising: (a) culturing a cell containing a DNA sequence coding for (i) a green fluorescent protein (GFP) having a protein kinase 
recognition site, or (ii) a green fluorescent protein wherein one or more amino acids have been substituted, inserted or deleted to provide 
a binding domain of a second messenger or an enzyme recognition site, or (iii) a hybrid polypeptide of green fluorescent protein (GFP) 
or a modified GFP and an enzyme recognition site or a binding domain for a second messenger under conditions permitting expression of 
the DNA sequence; (b) measuring the fluorescence of the cell; (c) incubating the cell with a sample suspected of containing a biologically 
active substance affecting intracellular processes; and (d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), such change being indicative of the presence of a biologically 
active substance in said sample. 



BNSOOaO: <WO_9S2388aA1JU> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


ArmenU 


GB 


United Kingdom 


MW 


Malawi 


AT 


AUMTU 


GE 


Georgia 


MX 


Meaico 


AU 


Auscnlia 


GN 


Gtiinea 


NE 


Niger 


BB 


Baibadoft 


GR 


Greece 


NL 


Nethcrlaruls 


BE 


Betghim 


HU 


Hungary 


NO 


Norway 


BF 


BuTtina Ftso 


IE 


Ireland 


NZ 


New Zealand 


BG 


Bulgaria 


IT 


haly 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


FT 


Portugal 


BR 


Brmzil 


K£ 


Kenya 


RO 


Rocnania 


BY 


Belarus 


KG 


Kyigystan 


RU 


Russian Federation 


CA 


Canada 


KP 


Deniocraiic People's Republic 


SD 


Sudan 


CF 


Centra] African Republic 




of Korea 


SE 


Sweden 


CG 


Congo 


KR 


Republic of Korea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


Si 


Slovenia 


a 


Ctue d'lvoire 


U 


Liechtenstein 


SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Lanlca 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


cs 


CzechcMlovakia 


LT 


Lithuania 


TD 


Chad 


cz 


Czech Republic 


LU 


Luxembourg 


TG 


Togo 


DE 


Gennany 


LV 


Larvia 


TJ 


Tajikistan 


DK 


Denmark 


MC 


Monaco 


TT 


Trinidad and Tobago 


EE 


Estonia 


MD 


Republic of Moldova 


UA 


Ukraine 


ES 


Spain 


MG 


Madagascar 


UG 


Uganda 


n 


Finland 


ML 


Mali 


US 


United States of America 


FR 


Franoe 


MN 


Mongolia 


uz 


Uzbekistan 


GA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



BNSOOCID: <WO_9e2388aA1JL> 



wo 96/23898 



PCT/DK96/00052 



1 

A METHOD OF DETECTING BIOLOGICALLY ACTIVE SUBSTANCES 
FIELD OF INVENTION 

The present invention relates to a method of detecting biologically active substances 
affecting intracellular processes, and a DNA construct and a cell for use in the method. 

5 BACKGROUND OF THE INVENTION 

Second messengers and protein kinases play key roles in the signalling pathways that 
control the response of mammalian cells (and probably all eukaryotic cells) to most 
stimuli. Although such signalling pathways have been subjected to extensive studies, 
detailed knowledge on e.g. the exact timing and spatial characteristics of signalling 
10 events is often difficult to obtain due to lack of a convenient technology. There is, 
however, one exception to this rule: our understanding of the role of Ca^* in e.g. 
intracellular signalling has been greatly improved due to the development of the 
fluorescent Ca** probe FURA-2 that permits real times studies of Ca^* in single living 
cells. 

15 Moreover, the construction of probes for cAMP (Adams et al, Nature 349 (1991), 694- 
697) and activity of the cAMP-dependent protein kinase (Sala-Newby and Campbell, 
FEBS 307(2) (1992), 241-244) has been attempted. The protein kinase A probe, 
however, suffers from the drawback that it is based on the firefly luciferase and 
accordingly produces too little light for fast single cell measurements. The cAMP probe 

20 on the other hand has to be microinjected and is therefore not well suited for routine 
laboratory work. In conclusion, both probes lack some of the elegant properties that 
resulted in the widespread use of FURA-2. 

Recently it was discovered that Green Fluorescent Protein (GFP) expressed in many 
different cell types, including mammalian cells, became highly fluorescent (Chalfie et 
25 aL, Science 263 (1994), 802-805). WO/07463 describes a cell capable of expressing GFP 
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and a method for selecting cells expressing a protein of interest and GFP based on 
detection of GFP-fluorescence in the cells. 



SUMMARY OF THE INVENTION 

The purpose of the present invention is to provide a method of detecting a biologically 
5 active substance affecting intracellular processes based on the use of green fluorescent 
protein, including wild-type GFP derived from the jelly fish Aequorea victoria and 
modifications of GFP, such as modifications that changes the spectral properties of the 
GFP fluorescence, for the construction of probes, preferably real time probes for 
second messengers and protein kineise activity, 

10 In one aspect, the present invention relates to a DNA construct comprising a DNA 
sequence coding for 

(i) green fluorescent protein (GFP) wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an en2yme recognition site, or 

15 (ii) a hybrid p>olypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site. 



In another aspect, the present invention relates to a cell containing a DNA sequence 
coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
20 substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 
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(ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site, 

and capable of expressing said DNA sequence. 

In a further aspect, the present invention relates to a method of detecting a biologically 
5 active substance affecting intracellular processes, the method comprising 

(a) culturing a cell containing a DNA sequence coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 

10 (ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site 

under conditions permitting expression of the DNA sequence, r 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample suspected of containing a biologically active 
15 substance affecting intracellular processes, and 



(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), such 
change being indicative of the presence of a biologically active substance in said sample. 

In a still further aspect, the present invention relates to a method of characterizing the 
20 biological activity of a substance with biological activity, the method comprising 
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(a) culturing a cell containing a DNA sequence coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 

5 (ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site 

under conditions permitting expression of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample of a biologically active substance affecting 
10 intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), said 
change being characteristic of the biological activity of the biologically active substance 
in said sample. 

15 Furthermore, studies on the substrate specificity of the different protein kinase A 
(PKA) isoforms using synthetic peptides have shown that peptides containing the motifs 
RRXSX or RXKRXXSX (S being the phosphorylated amino acid) tend to be the best 
substrates for PKA, and a review by Zetterquist, 0. et al. (in Kemp, B.E. (ed.) Peptide 
and Protein Phosphorylation (1990), 172-188, CRC Press, Boca Raton, Florida, USA) 

20 confirms that most known substrates of PKA contain said motifs. 

Available amino acid sequences of GFP do not suggest that GFP is a PKA substrate 
because of a lack of recognition sites comprising the motifs RRXSX or RXKRX^X. 
It is therefore surprising that a native or wild-type green fluorescent protein (GFP) 
derived from the jellyfish Aequorea victoria can be phosphorylated by protein kinase A 
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and thereby the spectral properties of GFP are changed resulting in a substantial 
increase of fluorescence. 



In a preferred aspect, the present invention relates to a method of detecting a 
biologically active substance affecting intracellular processes, the method comprising 

5 (a) culturing a cell containing a DNA sequence coding for a wild-type green fluorescent 
protein having a protein kinase recognition site under conditions permitting expression 
of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample suspected of containing a biologically active 
10 substance affecting intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), such 
change being indicative of the presence of a biologically active substance in said sample. 

In a further preferred aspect, the present invention relates to a method of 
15 characterizing the biological activity of a substance with biological activity, the method 
comprising 

(a) culturing a cell containing a DNA sequence coding for a wild-type green fluorescent 
protein having a protein kinase recognition site, under conditions permitting expression 
of the DNA sequence, 

20 (b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample of a biologically active substance affecting 
intracellular processes, and 
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(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), said 
change being characteristic of the biological activity of the biologically active substance 
in said sample. 

5 In a still further preferred aspect the present invention relates to a DNA construct 
comprising the DNA sequence shown in Fig. 4a coding for a wild-type GFP and a 
transformed cell containing said DNA construct and capable of expressing said DNA 
sequence. The transformed cell is preferably a mammalian cell. A microorganism, E. 
coli NN049087, carrying the DNA sequence shown in Fig. 4a has been deposited for 
10 the purpose of patent procedure according to the Budapest Treaty in Deutsche 
Sanmilung von Mikroorganismen und Zellkulturen GmbH, Mascheroderweg 1 b, D- 
38124 Braunschweig, Federal Republic of Germany, under the deposition No. DSM 
10260, 

In the present context, the term "green fluorescent protein" is intended to indicate a 
15 protein which, when expressed by a cell, emits fluorescence (cf. Chalfie et al., Science 
263, 1994, pp. 802-805). In the following, GFP in which one or more amino acids have 
been substituted, inserted or deleted is most often termed "modified GFP", 

The term "second messenger" is used to indicate a low molecular weight component 
involved in the early events of intracellular signal transduction pathways. 

20 The term "binding domain of a second messenger" is used to indicate a segment of a 
protein which, in the course of intracellular metabolic processes, binds the secondary 
messenger. 

The term "enzyme recognition site" is intended to indicate a peptide sequence 
covalently modified by an enzyme (e.g. phosphorylated, glycosylated or cleaved), 
25 preferably the enzyme recognition site is a protein kinase recognition site, which is 
intended to indicate a peptide sequence covalently modified by a kinase, i.e. 
phosphorylated. 
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It should be emphasized that phosphorylation of a protein at a protein kinase 
recognition site often is followed (or accompanied) by dephosphorylation of said 
protein* A GFP based probe for activity of given protein kinase(s) would 
therefore also provide information on the activity of relevant protein 
5 phosphatases since the parameter monitored is the net phosphorylation of the GFP 
based probe. 

The term "hybrid polypeptide" is intended to indicate a polypeptide which is a fusion 
of at least a portion of each of two proteins, in this case at least a portion of the green 
fluorescent protein and at least a. portion of a binding domain of a second messenger 
10 or an enzyme recognition site. 

In the present context, the term "biologically active substance" is intended to indicate 
a substance which has a biological function or exerts a biological effect in the human 
or animal body. The sample may be a sample of a biological material such as a 
microbial extract, or it may be a sample containing a compound or mixture of 
15 compounds prepared by organic synthesis. 

The phrase "any change in fluorescence" means any change in absorption properties, 
such as wavelength and intensity, or any change in spectral properties of the emitted 
light, such as a change of wavelength, fluorescence lifetime, intensity or polarisation. 

The mechanism(s) behind a change in e.g. the fluorescence intensity of a 
20 modified GFP upon phosphorylation could be several. As one possibility, 

phosphorylation of said GFP variant could change the chromophore environment, 
either due to proximity of the added phosphate group or to phosphorylation 
induced conformational changes. Correspondingly, binding of e.g. a second 
messenger to the binding domain of a some GFP variant or GFP fusion protein 
25 could induce conformational changes that ultimately changes the chromophore 
envirormient and thereby the fluorescence. As support for these suggestions, it 
has been shown that amino acid substitutions distant to the chromophore (e.g. 
amino acids 167, 202, 203 and 222) can change the fluorescence intensity and 
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spectral characteristics of GFP (Ehrig et al. (1995) FEBS Letters 367:163; Heim 
et al. (1994) Proc. Natl. Acad. Sci. 91:12501). 

The development of luminescent probes according to the present invention allows real 
S time studies of second messengers and specific enzymes such as protein kinases in 
single living cells, thereby making it possible to study the precise timing and the spatial 
characteristics of these factors. Moreover, studies on heterogeneity in cell populations 
are made possible. 

Due to the strong fluorescence of GFP, the luminescence of cells expressing the probes 
10 may easily be detected and analyzed by employing a combination of fluorescence 
microscopy and image analysis. Furthermore, it should be emphasized that the probes 
described are easy to introduce into cells, as they can be expressed in the cells of 
interest after transfection with a suitable expression vector. 

Real time recombinant probes for second messengers and enzyme activity, such as 
15 kinase activity, are not only useful in basic research but also in screening programmes 
aiming at identifying novel biologically active substances. Many currently used screening 
programmes designed to find compounds that affect cAMP concentration and protein 
kinase activity are based on receptor binding and/or reporter gene expression. The 
recombinant probes described herein, on the other hand, make it possible to develop 
20 an entirely new type of screening assays able to monitor immediate and transient 
changes of cAMP concentration and protein kinase activity in intact living cells. 

Any novel feature or combination of features described herein is considered essential 
to this invention. 

DETAILED DESCRIPTION OF THE INVENTION 

25 In a preferred embodiment of the present invention, the gene encoding GFP is derived 
from the jellyfish Aequorea victoria. The sequence of this gene is described in Prasher 
et al,, Gene 111. 1992, pp, 229-233 (GenBank Accession No. M62653). The gene may 
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be modified so as to code for a variant GFP in which one or more amino acid residues 
have been substituted, inserted or deleted to provide a binding domain of a second 
messenger or an enzyme recognition site. According to this embodiment, it is preferred 
to insert a DNA sequence coding for an enzyme recognition site into the gene coding 
5 for GFP, for instance at one of the following positions: between amino acid 39 and 40, 
between amino acid 71 and 72, between amino acid 79 and 80, between amino acid 107 
and 108, between amino acid 129 and 130, between amino acid 164 and 165, or 
between amino acid 214 and 215. Points of insertion may be selected on the basis of 
surface probability (which may be calculated using the GCG software package which 

10 employs a formula of Emini et al., J. Virol 55(3), 1985, pp. 836-839). When the enzyme 
is protein kinase C, the recognition site inserted should preferably contain the motif 
XRXXSXRX, S being the phosphorylated amino acid. In successful constructs of this 
type, phosphorylation of the modified GFP may result in detectably altered optical 
proterties of GFP. It should be noted that extensive deletion may result in loss of the 

15 fluorescent properties of GFP. It has been shown, that only one residue can be 
sacrificed from the amino terminus and less than 10 or 15 from the carboxyl terminus 
before fluorescence is lost, cf. Cubitt et al. TIBS Vol. 20 (11), pp. 448-456, November 
1995. Thus, according to this invention the modification of the GFP gene so as to code 
for a variant GFP in which one or more amino acid residues have been substituted, 

20 inserted or deleted is limited to modifications resulting in a variant GFP having 
fluorescence properties. 

The binding domain of a second messenger may be a receptor of a second messenger. 
The second messenger may be cyclic AMP, inositol phosphate 3, cyclic GMP, cyclic 
ADP or diacylglycerol. The binding domain is preferably the cyclic AMP receptor 
25 (CRP, e.g. as described in Weber and Steitz, J, Mol. Biol. 198, 1987, pp. 311-326; 
Schroeder and Dobrogosz, J. Bacteriol. 167, 1986, pp. 612-622) or a part thereof 
capable of binding cyclic AMP. 

Native CRP has two distinct domains: an N-terminal cAMP binding domain as well as 
a C-terminal DNA binding activity (Weber and Steitz, J. Mol. Biol. 198 (1987), 311- 
30 326). Upon binding of cAMP to the N-terminal portion of CRP a conformational 
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change is induced in the C-terminus, which allows the binding of CRP to the promoters 
of certain genes. In the successful fusions of CRP (or a portion thereof) to GFP (or a 
portion thereof), cAMP induced conformotional changes in CRP are transmitted to 
GFP, thereby changing the optical properties of GFP. 

5 In a preferred embodiment of the present invention, the gene or cDNA sequence 
encoding a wild-type GFP is derived from the jellyfish Aequorea victoria. A preferred 
sequence of this gene is disclosed by Fig. 4a herein. Fig. 4a shows the nucleotide 
sequence of a wild-type GFP (Hind3-EcoRl fragment) and Fig, 4b shows the amino 
acid sequence, wherein start codon ATG corresponds to position 8 and stop codon 

10 TAA corresponds to position 722 in the nucleotide sequence of Fig. 4a. Another 
sequence of an isotype of this gene is disclosed by Prasher et al.. Gene 111 , 1992, pp. 
229-233 (GenBank Accession No. M62653). Any gene that codes for a fluorescent 
protein, such as wild-type GFP, having a protein kinase recognition site, and derived 
from any organism expressing a green fluorescent protein or similar fluorescent, 

15 phosphorescent or luminescent protein may be used in this invention. 

The enzyme recognition site or protein kinase recognition site is preferably a SeryThr 
or Tyr protein kinase, such as protein kinase C or a protein kinase A recognition site 
(both are reviewed in e.g, B,E, Kemp and R.B. Pearson; TIBS 15, Sept. 1990, pp. 342- 
346), or the insulin receptor or the Src kinase or a portion thereof containing a motif 
20 required £ls a substrate for protein kinase, as suggested above. Kinase catalysed 
phosphorylation may result in detectably altered optical properties of GFP, 

The DNA sequence encoding GFP, the binding domain of a second messenger or the 
enzyme recognition site may suitably be of genomic or cDNA origin, for instance 
obtained by preparing a suitable genomic or cDNA library and screening for DNA 
25 sequences coding for all or part of any of these proteins by hybridization using synthetic 
oligonucleotide probes in accordance with standard techniques (cf, Sambrook et al., 
supra V 
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The DNA construct of the invention encoding the wild-type GFP, modified GFP or 
hybrid polypeptide may also be prepared synthetically by established standard methods, 
e.g. the phosphoamidite method described by Beaucage and Caruthers, Tetrahedron 
Letters 22 (1981), 1859 - 1869, or the method described by Matthes et aL, EMBO 
5 Journal 3 (1984), 801 - 805. According to the phosphoamidite method, oligonucleotides 
are synthesized, e.g. in an automatic DNA synthesizer, purified, annealed, ligated and 
cloned in suitable vectors. For most purposes, it may be practical to prepare a shorter 
DNA sequence such as the DNA sequence coding for the enzyme recognition site 
synthetically, while the DNA coding for GFP or the binding domain of a second 
10 messenger will typically be isolated by screening of a DNA library. 

Furthermore, the DNA construct may be of mixed synthetic and genomic, mixed 
synthetic and cDNA or mixed genomic and cDNA origin prepared by ligating fragments 
of synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding 
to various parts of the entire DNA construct, in accordance with standard techniques 
15 (cf. Sambrook et al., Molecular Cloning: A Laboratory Manual 1989, Cold Spring 
Harbor Laboratory, New York, USA). 

The DNA construct may also be prepared by polymerase chain reaction (PCR) using 
specific^ primers, for instance as described in US 4,683,202 or Saiki et al., Science 239 
(1988), 487 - 491. A more recent review of PCR methods may be found in PCR 
20 Protocols, 1990, Academic Press, San Diego, California, USA. 

The DNA sequence coding for GFP may also be modified by other means such as by 
conventional chemical mutagenesis or by insertion, deletion or substitution of one or 
more nucleotides in the sequence, either as random or site-directed mutagenesis. It is 
expected that such mutants will exhibit altered optical properties or altered heat 
25 stability. 

The DNA construct of the invention may be inserted into a recombinant vector which 
may be any vector which may conveniently be subjected to recombinant DNA 
procedures. The choice of vector will often depend on the host cell into which it is to 
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be introduced. Thus, the vector may be an autonomously replicating vector, i.e. a vector 
which exists as an extrachromosomal entity, the replication of which is independent of 
chromosomal replication, e.g. a plasmid. Alternatively, the vector may be one which, 
when introduced into a host cell, is integrated into the host cell genome and replicated 
5 together with the chromosome(s) into which it has been integrated. 

The vector is preferably an expression vector in which the DNA sequence encoding 
wild-type GFP, the modified GFP or the hybrid polypeptide is operably linked to 
additional segments required for transcription of the DNA. In general, the expression 
vector is derived from plasmid or viral DNA, or may contain elements of both. The 
10 term, "operably linked" indicates that the segments are arranged so that they function 
in concert for their intended purposes, e.g. transcription initiates in a promoter and 
proceeds through the DNA sequence coding for the modified GFP or hybrid 
polypeptide 

The promoter may be any DNA sequence which shows transcriptional activity in the 
15 host cell of choice and may be derived from genes encoding proteins either homologous 
or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of the DNA sequence 
encoding wild-type GFP, the modified GFP or hybrid polypeptide in manunalian cells 
are the SV40 promoter (Subramani et al., Mol. Cell Biol. 1 fl981\ 854 -864), the MT-1 
20 (metallothionein gene) promoter (Palmiter et al., Science 222 (1983), 809 - 814) or the 
adenovirus 2 major late promoter. 

An example of a suitable promoter for use in insect cells is the polyhedrin promoter 
(US 4,745,051; Vasuvedan et al., FEES Lett. 31h (1992) 7 - 11), the PIO promoter 
(J.M. Vlak et al., J. Gen. Virology 69, 1988, pp. 765-776), the Autographa califomica 
25 polyhedrosis virus basic protein promoter (EP 397 485), the baculovirus immediate 
early gene 1 promoter (US 5,155,037; US 5,162,222), or the baculovirus 39K delayed- 
early gene promoter (US 5,155,037; US 5,162,222). 
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Examples of suitable promoters for use in yeast host cells include promoters from yeast 
glycolytic genes (Hitzeman et aL, J. Biol. Chem. 255 (1980), 12073 - 12080; Alber and 
Kawasaki, J- Mol. Appl. Gen. 1 (1982), 419 - 434) or alcohol dehydrogenase genes 
(Young et al., in Genetic Engineering of Microorg anisms for Chemicals (Hollaender 
5 et al, eds.), Plenum Press, New York, 1982), or the TPU (US 4,599,311) or ADH2-4c 
(Russell et al.. Nature 304 (1983), 652 - 654) promoters. 

Examples of suitable promoters for use in filamentous fungus host cells are, for 
instance, the ADH3 promoter (McKnight et al., The EMBO J. 4 (1985), 2093 - 2099) 
or the lEiA promoter. Examples of other useful promoters are those derived from the 
10 gene encoding A. oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, A. 
niger neutral o-amylase, A. niger acid stable a-amylase, A. niger or A. awamori glu- 
coamylase (gluA), Rhizomucor miehei lipase, A. oryzae alkaline protease. A: oryzae 
triose phosphate isomerase or A. nidulans acetamidase. Preferred are the TAKA- 
amylase and gluA promoters. 

15 Examples of suitable promoters for use in bacterial host cells include the promoter of 
the Bacillus stearothermophilus maltogenic amylase gene, the Bacillus licheniformis 
alpha-amylase gene, the Bacillus amyloUquefacietis BAN amylase gene, the Bacillus 
subtilis alkaline protease gen, or the Bacillus pumilus xylosidase gene, or by the phage 
Lambda Pr or Pl promoters or the E. coli lac, tre or tac promoters. 

20 The DNA sequence encoding wild-type GFP, the modified GFP or hybrid polypeptide 
of the invention may also, if necessary, be operably connected to a suitable terminator, 
such as the human growth hormone terminator (Palmiter et al., oe^ ck.) or (for fungal 
hosts) the TPIl (Alber and Kawasaki, og, dt) or ADH3 (McKnight et al., cit,) 
terminators. The vector may further comprise elements such as polyadenylation signals 

25 (e.g. from SV40 or the adenovirus 5 Elb region), transcriptional enhancer sequences 
(e.g. the SV40 enhancer) and translational enhancer sequences (e.g. the ones encoding 
adenovirus VA RNAs). 
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The recombinant vector may further comprise a DNA sequence enabling the vector to 
replicate in the host cell in question. An example of such a sequence (when the host 
cell is a mammalian cell) is the SV40 origin of replication. 

When the host cell is a yeast cell, suitable sequences enabling the vector to replicate 
5 are the yeast plasmid 2^ replication genes REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene the product of which 
complements a defect in the host cell, such as the gene coding for dihydrofolate 
reductase (DHFR) or the Schizosaccharomyces pombe TPI gene (described by P.R. 
Russell, Gene 40, 1985, pp. 125-130), or one which confers resistance to a drug, e.g. 
10 ampiciliin, kanamycin, tetracyclin, chloramphenicol, neomycin or hygromycin. For 
filamentous fungi, selectable markers include amdS. pvrG. argB. niaP. sC. 

The procedures used to ligate the DNA sequences coding for wild-type GFP, the 
modified GFP or hybrid polypeptide, the promoter and optionally the terminator and/or 
secretory signal sequence, respectively, and to insert them into suitable vectors 
15 containing the information necessary for replication, are well known to persons skilled 
in the art (cf., for instance, Sambrook et al., op.cit. l. 

The host cell into which the DNA construct or the recombinant vector of the invention 
is introduced may be any cell which is capable of expressing the present DNA construct 
and includes bacteria, yeast, fungi and higher eukaryotic cells, such as mammalian cells. 

20 Examples of bacterial host cells which, on cultivation, are capable of expressing the 
DNA construct of the invention are grampositive bacteria such as strains of Bacillus, 
such as strains of A subtiUs, B. licheniformis, B. lentus, B. brevis, B. siearothermophilus, 
B. alkalophUus, B. amyloliquefaciens, B. coagulans, B. circulans, B. lautus.B. megatherium 
or B, thuringiensiSy or strains of Streptomyces, such as 5. Uvidans or 5. murinuSy or 

25 gramnegative bacteria such as Echerichia coli. The transformation of the bacteria may 
be effected by protoplast transformation or by using competent cells in a manner 
known per se (cf. Sambrook et ah, supra V 
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Examples of suitable mammalian cell lines are the HEK293 and the HeLa cell lines, 
primary cells, and the COS (e^g. ATCC CRL 1650), BHK (e.g. ATCC GRL 1632, 
ATCC CCL 10), CHL (e.g. ATCC CCL39) or CHO (e.g. ATCC CCL 61) cell lines. 
Methods of transfecting mammalian cells and expressing DNA sequences introduced 
5 in the cells are described in e.g. Kaufman and Sharp, J. Mol. Biol. 159 (1982), 601 - 
621; Southern and Berg, J. Mol. AdpL Genet. 1 (1982), 327 - 341; Loyter et al., Proc. 
Natl. Acad. Sci. USA 79 (1982), 422 - 426; Wigler et al.. Cell li (1978), 725; Corsaro 
and Pearson, Somatic Cell Genetics 7 (1981), 603, Graham and van der Eb, Virology 
52 (1973), 456; and Neumann et al., EMBO J. 1 (1982), 841 - 845. 

10 Examples of suitable yeasts cells include cells of Saccharomyces spp. or 
Schizosaccharomyces spp., in particular strains of Saccharomyces cerevisiae or 
Saccharomyces kluyveri. Methods for transforming yeast cells with heterologous DNA 
and producing heterologous polypeptides therefrom are described, e.g. in US 4,599,311, 
US 4,931,373, US 4,870,008, 5,037,743, and US 4,845,075, all of which are hereby 

15 incorporated by reference. Transformed cells are selected by a phenotype determined 
by a selectable marker, commonly drug resistance or the ability to grow in the absence 
of a particular nutrient, e.g. leucine. A preferred vector for use in yeast is the- POTl 
vector disclosed in US 4,931,373. The DNA sequence encoding the modified GFP or 
hybrid polypeptide may be preceded by a signal sequence and optionally a leader 

20 sequence , e.g. as described above. Further examples of suitable yeast cells are strains 
of Kluyveromyces^ such as IC lactis, Hansenula, e.g. K pofymorpha, or Pichia, e.g. P. 
pastoris (cf. Gleeson et aL. J, Gen, Microbiol. 132. 1986, pp. 3459-3465; US 4,882,279). 

Examples of other fungal cells are cells of filamentous fungi, e.g. Aspergillus spp., 
Neurospora spp., Fusarium spp. or Trichoderma spp., in particular strains olA, oryzae, 
25 A. nidulans or A. nigen The use of Aspergillus spp. for the expression of proteins is 
described in, e.g., EP 272 277, EP 230 023, EP 184 438. 

When a filamentous fungus is used as the host cell, it may be transformed with the 
DNA construct of the invention, conveniently by integrating the DNA construct in the 
host chromosome to obtain a recombinant host cell. This integration is generally con- 
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sidered to be an advantage as the DNA sequence is more likely to be stably maintained 
in the cell. Integration of the DNA constructs into the host chromosome may be per- 
formed according to conventional methods, e.g. by homologous or heterologous 
recombination. 

5 Transformation of insect cells and production of heterologous polypeptides therein may 
be performed as described in US 4,745,051; US 4,879,236; US 5,155,037; 5,162,222; EP 
397,485) all of which are incorporated herein by reference. The insect cell line used as 
the host may suitably be a Lepidoptera cell line, such as Spodoptera fmgiperda cells or 
Trichoplusia ni cells (cf. US 5,077,214). Culture conditions may suitably be as described 
10 in, for instance, WO 89/01029 or WO 89/01028, or any of the aforementioned 
references. 

The transformed or transfected host cell described above is then cultured in a suitable 
nutrient medium under conditions permitting the expression of the present DNA 
construct after which the cells may be used in the screening method of the invention. 
15 Alternatively, the cells may be disrupted after which cell extracts and/or supematants 
may be analysed for fluorescence. 

The medium used to culture the cells may be any conventional medium suitable for 
growing the host cells, such as minimal or complex media containing appropriate 
supplements. Suitable media are available from commercial suppliers or may be 
20 prepared according to published recipes (e.g. in catalogues of the American Type 
Culture Collection). 

In the method of the invention, the fluorescence of cells transformed or transfected 
with the DNA construct of the invention may suitably be measured in a spectrometer 
where the spectral properties of the cells in liquid culture may be determined as scans 
25 of light excitation and emission. Alternatively, such cells grown on nitrocellulose filters 
placed on plates containing solid media may be illuminated with a scanning 
polychromatic light source and imaged with an integrating colour camera. The colour 
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of the emitted light may then be determined by image analysis using specialised 
software. 

The invention is further illustrated in the following examples with reference to the 
appended drawings, wherein 

5 Fig. 1 shows a map of the pUC19-GFP plasmid construction. GFP nucleotide numbers 
referred to below with a "G" are from the GenBank GFP sequence record (accession 
No. M62653). Bases in italics represent GFP sequence. The pUC19 nucleotide numbers 
referred to below with a "P" are from the GenBank pUC19 sequence record (accession 
No. X02514). Bases in plain text represent pUCl9 sequence. Bases in bold represent 
10 non-GFP non-pUC19 sequence, which have been inserted by PGR for the introduction 
convenient restriction sites. 

Fig. 2 shows maps of the four basic GFP-CRP fusion constructs: ^ 

A) Full length GFP at the N-terminal fused with full length CRP at the C-terminal. 

B) Truncated GFP at the N-terminal fused with full length CRP at the C-terminal. 
15 C) Full length CRP at the N-terminal fused with full length GFP at the C-terminal. 

D) Truncated CRP at the N-terminal fused with full length GFP at the C-terminal, 
corresponding to the construct where the DNA binding domain of CRP has been 
replaced with GFP. 

GFP nucleotide numbers referred to below with a "G" are from the GenBank GFP 
20 sequence record (accession No. M62653). CRP nucleotide numbers referred to below 
with a "C are from the GenBank CRP sequence record (accession No. M 13770). The 
pUC19 nucleotide numbers referred to below with a "P" are from the GenBank pUC19 
sequence record (accession No. X02514). 

Example 1 

25 Cloning of cDNA encoding the green fluorescent protein 
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Briefly, total RNA, isolated from>l. victoria by a standard procedure (Sambrook et al., 
Molecular Cloning. 2., eds. (1989) (Cold Spring Harbor Laboratory Press: Cold Spring 
Harbor, New York), 7.19-7.22) was converted into cDNA by using the AMV reverse 
transcriptase (Promega, Madison, WI, USA) as recommended by the manufacturer. 
5 The cDNA was then PCR amplified, using PCR primers designed on the basis of a 
previously published GFP sequence (Prasher et al., Gene 111 (1992), 229-233; 
GenBank accession No. M62653) together with the UlTma™ polymerase (Perkin 
Elmer, Foster City, CA, USA). The sequences of the primers were: GFP2: 
TGG AATAAGCTTTATG AGTAAAGG AG AAG AACTTTT and GFP- 1 : 

10 AAGAATTCGGATCCCTTTAGTGTCAATTGGAAGTCr 

Restriction endonuclease sites inserted in the 5* (a Hindlll site) and 3* (EcoRI and 
BamHI sites) primers facilitated the cloning of the PCR amplified GFP cDNA into a 
slightly modified pUC19 vector. The details of the construction are as follows: LacZ 
Shine-Dalgamo AGGA, immediately followed by the 5' Hindlll site plus an extra T and 

15 the GFP ATG codon, giving the following DNA sequence at the lacZ-promoter GFP 
fusion point: /\5£Z-AGGAAAGCTTT ATG -GFP. At the 3' end of the GFP cDNA, the 
base pair corresponding to nucleotide 770 in the published GFP sequence (GenBank 
accession No. M62653) was fused to the EcoRI site of the pUCl9 multiple cloning site 
(MCS) through a PCR generated BamHI, EcoRI linker region. 

20 Example 2 

Isolation of mutant GFPs 

A variant of GFP with altered optical properties and/or heat stability is prepared by 
subjecting the GFP described in Example 1 to a round of chemical mutagenesis 
followed by screening potential mutants for altered properties. 

25 In brief, the GFP-encoding DNA sequence described in Example 1 (the Hindlll-EcoRI 
fragment) is heat-denatured and subjected to chemical mutagens essentially as 
described by Myer et al., Science 229, 1985, p. 242. The mutagen is either nitrous acid, 
or permanganate or formic acid. The resulting mutated population of single stranded 
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GFP fragments are either amplified by PCR using the primers described in Example 
1, or reverse transcribed by AMV reverse transcriptase as described in Example 1 prior 
to amplification by PCR. The PCR products are cleaved by restriction enzymes Hindlll 
and EcoRI and the products of this reaction are ligated into the modified pUC19 
5 plasmid described in Example L 

The ligation reaction is transformed into an E. coli strain and plated on LB agar plates 
containing 100 Mg/nil ampicillin to give approximately 500 transformants per plate. The 
fluorescence of GFP in the cells is detected by exciting the plates with a light source 
at 398 nm or 365 nm. Replicas of colonies are made onto fresh plates or plates on 

10 which a nitrocellulose filter has been placed prior to replication. When colonies have 
formed once more, they are individually collected and resuspended in water. The cell 
suspensions are placed in a LS50B Luminescence Spectrometer (Perkin Elmer Ltd., 
Beaconsfield, Buckinghamshire, England) equipped with a temperature-controlled cuvet 
holder, and the spectral properties (scans of both light excitation and emission) are 

15 determined. Alternatively, whole plates with approximately 500 transformants are 
illuminated with a scanning polychromatic light source (fast monochromator from 
TLL-L. Photonics, Munich, Germany) and imaged with an integrating RGB colour 
camera (Photonic Science Color Cool View). The actual colour of the emitted light was 
determined by image analysis using the Spec R4 software (Signal Analytics Corporation, 

20 Vienna, VA, USA). 

Heat sensitivity of the mutated GFP is tested by characterizing its spectral properties, 
as described above, after a sequential rise of the temperature from 20*C to 80'C. 

In another round of mutagenesis, E.coli cells containing the GFP pUCl9 plasmid 
described in Example 1, are subjected to treatment with N-methyl-N-nitro-N- 
25 nitrosoguanidine at a concentration of 25 milligrams per liter for 18 hours, and the cells 
are plated and analyzed as described above. Alternatively, plasmids are first recovered 
from the treated cells and transformed into E.coli and plated and analyzed as described 
above. 
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Example 3 

Construction of a GFP-based recombinant cAMP probe 

The basis of the GFP-based recombinant cAMP probe described herein is the fusion 
of a portion of the cAMP receptor protein (CRP) from E, coli to GFP. 

5 It was decided to prepare 4 basic GFP-CRP fusion constructs, from which a whole 
array of semi-random fusion constructs may be generated, some of which are expected 
to have the ability to induce conformational changes in GFP when cAMP is bound to 
the N-terminal portion of CRP resulting in detectable changes in the optical properties 
of GFP. 

10 7. Description of the four basic GFP-CRP fusions 

The plasmid harbouring the GFP-CRP fusion shown in figure 2 A) was constructed the 
following way: The CRP insert of plasmid pHA7 (Aiba et al., Nucl. acids Res. 10 (1982) 
1345-1377) was PCR amplified with the PCR primers 

CRPl (CGATACAGATCTAAGCTTTATGGTGCTTGGCAAACCGC) and 
15 CRP-2 (CGGAATTCTTAAAAGCTTAGATCTTTACCGTGTGCGGAGATCAG) 
followed by digestion with the restriction endonucleases Bglll and EcoRI. The GFP 
insert of plasmid pUCl9-GFP (see Example 1) was PCR amplified using the PCR 
primers GFP2 (see Example 1) and 

GFP-4 (GAATCGTAGATCTTTGTATAGTTCATCCATGCCATG) followed by 
20 digestion with the restriction endonucleases Hindlll and Bglll. Subsequently, in a three- 
part ligation, the Bglll/EcoRI fragment of the PCR amplified CRP DNA and the 
Hindlll/Bglll fragment of the PCR amplified GFP DNA was ligated with a 
Hindlll/EcoRI vector fi^agment of the slightly modified pUCl9 plasmid described in 
Example 1, followed by transformation of £. coli. 
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The plasmid harbouring the GFP-CRP fusion shown in figure 2 B) was constructed 
essentially as described above for the figure 2 A) plasmid with a single modification: 
The PCR primer 

GFP-3 (GAATCGTAG ATCTTTG ACTTCAGCACGTGTCTTGTA) was used instead 
5 of the GFP-4 PCR primer. 

The plasmid harbouring the CRP-GFP fusion shown in figure 2 C) was made by PCR 
amplification of the CRP insert of plasmid pHA7 with PCR primers CRPl and CRP-2, 
followed by digestion with restriction endonuclease Hindlll and ligation into the Hindlll 
site of plasmid pUC19-GFP (see Example 1), 

10 The plasmid harbouring the GFP-CRP fusion shown in figure 2 D) was constructed 
essentially as described above for the figure 2 C) plasmid with a single modification: 
The PCR primer 

GFP-1 (CCAGTTAAGCTTAGATCTTCCGGGTGAGTCATAGCGTCTGG) was used 
instead of the CRP-2 PCR primer, 

15 2 Generation of semirandom GFP-CRP fusions 

The 4 basic GFP-CRP fusion plasmids described above are digested with the restriction 
endonuclease Bglll (opening the plasmids at GFP-CRP fusion points), followed by 
treatment with the double stranded exonuclease Bal31 for 1 minute, 2 minutes, 3 
minutes etc. up to 20 minutes (cf. Sambrook et al, op. cit. at 15.24). Subsequently, the 
20 Bal31 treated DNA is incubated with the T4 DNA polymerase (cf. Sambrook et ah, og. 
cit. at 15,24) to generate blunt ends, followed by self ligation (essentially as described 
by Sambrook et al., op. cit. at 1.68). Finally, the self ligated Bal31 treated plasmid DNA 
is transformed into E. colL 

3a. Screening of the CRP-GFP fusiom for cAMP induced changes in fluorescence 

25 E. coli transformants expressing one of the four basic CRP-GFP fusions or one of the 
semirandom GFP-CRP fusions are grown overnight in 2 ml Luria-Bertani medium with 
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added ampicilin (100 ^g/ml). The cells are then pelleted by centrifugation followed by 
resuspension in 0.5 ml lysis buffer (100 mM NaCl, 1 mM EDTA and 50 mM Tris pH 
8.0). Subsequently, 25 /il 10 mg/ml Lysozyme is added to the resuspended cells, 
followed by incubation for 10 min. at room temperature, vigorous vortexing and 
5 centrifugation for 5 min. at 20 000 x g. Finally, emission and excitation spectra for the 
resulting protein extracts (the supematants) are acquired by using the LS50B 
Luminescence Spectrometer and the FL Data Manager software package (both from 
Perkin Elmer Ltd., Beaconsfield, Buckinghamshire, England). The spectra recorded 
before as well as after the addition of cAMP to a final concentration of 0.5 mM are 
10 compared by using the Graph Builder software package (Perkin Elmer). The CRP-GFP 
fusions exhibiting cAMP induced changes in fluorescence are investigated further by 
expression in mammalian cells. 

3b. (alternative protocol) Screening of the CRP-GFP fusions for cAMP induced changes 
in fluorescence. 

15 Cyclic AMP levels in E.coli cells vary according to the carbon source provided; see e.g. 
Epstein et.al. (1975), Proc.Natl.Acad.Sci.USA 72, pp. 2300-2304, and Botsford and 
Harman (1992), Microbiological Reviews 56, p. 100-122. For example, cells grown on 
glucose contain a lower level of cAMP than cells grown on e.gi glycerol. Furthermore, 
shifting cells from one carbon source to another, or adding glucose to a culture grown 

20 on a non-glucose carbon source change the cAMP level of the cells. Hence, the cAMP- 
induced change in the fluorescence of the CRP-GFP fusions may be determined by 
continuously measuring the fluorescence of cells expressing the fusions, after transfer 
from medium containing e.g. glycerol as carbon to medium containing 0,2% glucose. 
The cells are analyzed in liquid culture in the LS50B Luminescence Spectrometer or 

25 by growing them on nitrocellulose filters placed on plates with solid media; the filter 
is transferred from plates with one type of medium to plates with another type of 
medium, and the fluorescence is continuously monitored by exciting the plates with a 
scarming polychromatic light source (fast monochromator from T.I.L.L. Photonics, 
Munnich, Germany) and collecting colour images with an integrating RGB color 

30 camera (Photonic Science Color Cool View). The actual colour of the emitted light is 
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determined by image analysis using the Spec R4 software package (Signal Analytics 
Corporation, Vienna, VA, USA). 

Example 4 

Construction of a GFP based recombinant probe for protein kinase activity 

5 Description of the GFP based recombinant protein kinase C (PKC) substrates 

Studies on the substrate specificity of the different PKC isoforms using synthetic 
peptides have shown that peptides containing the motif XRXXSXRX (S being the 
phosphorylated amino acid) tend to be the best substrates for PKC (as reviewed in 
Kemp, B, E, and Pearson, R. B. (1990) TIBS 15 Sept^ 342-346). Moreover, the 

10 naturally occurring neuronal PKC substrate GAP-43 has the following amino acid^ 
sequence around the phosphorylated serine residue (underlined): AATKIQASFRGHIT 
(Kosik, K.S et al. (1988) Neuron 1, 127-132). On the basis of these data we have 
selected the putative PKC recognition motif RQASFRS for insertion in GFP at various 
positions. Insertion points were selected on the basis of surface probability (calculated 

15 using the GCG software package, which employs a formula of Emini et aL(1985) J. 
ViroL, 55(3), 836-839 ), slightly modified for the end values of the protein chains. The 
single probabaities are taken from Janin et al. (1978) J. MoL Biol. 125, 357-386) and/or 
vicinity of the GFP chromophore. The heptapeptide is inserted in GFP by PCR at the 
following positions: 

20 Between amino acid (aa) 39 and aa 40 (PCR primers PKC-1: 
GATACCAAAGATCTGAAAGAAGCTTGTCGGTATGTTGCATCACCTTCACC 

and 

PKCl: GATACCAAAGATCTGGAAAACTTACCCTTAAATTT), 

between aa 52 and aa 53 (PCR primers PKC-2: 
25 GATACCAAAGATCrGAAAGAAGCTTGTCGTTTTCCAGTAGTGCAAATAAA 

and 

PKC2: GATACCAAAGATCTCTACCTGTTCCATGGCCAACAC), 
between aa 71 and aa 72 (PCR primers PKC-3: 
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GATACCAAAGATCTGAAAGAAGCTTGTCGAAAGCATTGAACACCATAAGA 
and 

PKC3: GATACCAAAGATCTTCAAGATACCCAGATCATATG), 
between aa 79 and 80 (PGR primers PKC-4: 

5 GATACXAAAGATCTGAAAGAAGCITGTCGTTTCATATGATCrGGGTATCTand 
PKC4: GATACCAAAGATCTCAGCATGACTmrCAAGAGT), 
between aa 107 and 108 (PGR primers PKC-5: 

GATACCAAAGATCTGAAAGAAGCTTGTCGCTTGTAGTTCCCGTCATCrTTand 
PKC5: GATACCAAAGATCTACACGTGCTGAAGTCAAGTTT), 
10 between aa 129 and 130 (PGR primers PKC-6: 

GATAarAAAGATCTGAAAGAAGCrrGTCGATCAATACCTTTTAACrCGATand 
PKC6: GATACCAAAGATCTTTTAAAGAAGATGGAAACATT), 
between aa 164 and 165 (PGR primers PKC-7: 

GATACCAAAGATCTGAAAGAAGCTTGTCGGTTAACrTTGATTCCATTCTTand 
15 PKC7: GATACCAAAGATCTTTCAAAATTAGACACAACATT) 
and between aa 214 and 215 (PGR primers PKC-8: 

GATACCAAAGATCrGAAAGAAGCTTGTCGCTTTTCGTrGGGATCrTTCGAand 

PKC8: GATACCAAAGATCTAGAGACCACATGGTCCTTCTT). 

The PGR primers were designed in the following way: Reverse primers: 5'- 

20 GATACCAA AGA TCT GAA AGA AGC TTG TCG-3' + 21 nucleotides of the 
antisense strand (upstream of the second aa mentioned) and forward primers: 5'- 
GATACCAA AGA TCT-3' + 21 nucleotides of the sense strand (downstream of the 
first aa), each PGR primer being provided with a unique Bglll site (giving rise to the 
arginine and serine residues of the heptapeptide). The PKC site is inserted by PGR of 

25 pUC19-GFP plasmid DNA (see Example 1) with the 8 forward primers and the 8 
matching reverse primers, followed by digestion with Bglll, self-ligation and 
transformation of E. coli (cf. Sambrook ct al., op. cit. l 

Z Screening of he CFP based recombinant PKC substrates for phosphorylation induced 
changes in fluorescence 
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E. coli transformants expressing one of the eight GFP based recombinant PKC 
substrates are grown overnight in 2 ml Luria-Bertani medium with added ampicilin (100 
fig/ml). The cells are then pelleted by centrifugation followed by resuspensioh in 0.5 ml 
lysis buffer (100 mM NaCl, 1 mM EDTA and 50 mM Tris pH 8.0). Subsequently, 25 
5 /xl 10 mg/ml Lysozyme is added to the resuspended cells, followed by incubation for 10 
min. at room temperature, vigorous vortexing and centrifugation for 5 min. at 20 000 
X g; Finally, emission and excitation spectra for the resulting protein extracts (the 
supematants) are acquired by using the LS50B Luminescence Spectrometer and the FL 
Data Manager software package (Perkin Elmer). The spectra recorded before as well 
10 as after treatment of the extracts with purified PKC (Promega, Madison, WI, USA) 
according to the manufacturers instruction, are compared by using the Graph Builder 
software package (Perkin Elmer). The GFP based recombinant PKC substrates 
exhibiting phosphorylation induced changes in fluorescence are investigated further by 
expression in mammalian cells. 

15 Example 5 

>' 

Characterization of the recombinant fusion probes in mammalian cells. 

The CRP-GFP fusions (Example 3) exhibiting cAMP-induced changes in fluorescence 
as well as the GFP -based recombinant PKC substrates exhibiting phosphorylation- 
induced changes in fluorescence are investigated further by expression in mammalian 
20 cells. 

Inserts of the respective plasmids are isolated by digestion with the restriction 
endonucleases Hindlll and BamHI and ligated into the Hindlll and BamHI sites of the 
MCS of the mammalian pREP4 vector (Invitrogen, San Diego, California, USA). 
Subsequently, Baby Hamster Kidney (BHK) are transfected with the resulting plasmid 
25 constructs according to the standard calcium phosphate-DNA precipitate protocol (cf. 
Sambrook et al,, op. cit. at 16.33-16.35). Stable transfectants with high expression of the 
recombinant probes are identified and cloned after 6-14 days in culture by quantifying 
the fluorescence in an image analysis system, which consists of a Nikon Diaphot 200 
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microscope with a temperature controlled stage, a slow scan CCD camera (T.I.L.L. 
Photonics), a polychromatic light source (T,LL,L. Photonics), and a PC based image 
analysis software package (FUCAL from TJ.L.L. Photonics). Alternatively, the 
fluorescence prop>erties are monitored in a photometer based system. In this system the 
5 CCD camera is replaced by a photomultiplier D104 (PTI, Canada). 

The clones are cultured for a couple of days in glass coverslip chambers (NUNC, 
Copenhagen, Denmark) before image analysis. 

The ability of the clones to detect changes in cAMP is characterized by elevating 
intracellular cAMP level by challenging the cells with forskolin (0.1-10/iM) or dibutyryl- 

10 cAMP (1-100 fxM) and monitoring the associated change of spectral properties. 
Similarly, clones that are sensitive to variations in PKC activity are characterized by 
activating PKC in them with PMA (phorbol 12-myristate 13-acetate) (10-1000 nM) or 
OAG (l-oleoyl-2-acetyl-sn-glycerol) (1-100 /iM). The stimulant-induced changes of 
fluorescence properties are monitored continuesly using above mentioned imaging 

15 system. Combining imaging with photometry makes it possible to characterize the 
response of the recombinant probes in both high spatial and high temporal resolution. 

Example 6 

GFP as a recombinant probe for protein kinase activity 
Purification of GFP from E. coli cells expressing GFP 

20 E. coli cells containing a plasmid allowing expression of GFP were grown overnight at 
24°C. Cells were pelleted, the supernatant was discarded, and the pellet was 
resuspended in 1/20 of the original volume in lOOmM Na-phosphate buffer (pH 8.0). 
Cells were disrupted by sonication, and cell debris were pelleted at 12000g for 20 
minutes. The supernatant was recovered, ammonium sulphate was added to a final 

25 concentration of 1.5M, and the resulting solution was subjected to hydrophobic 
interaction chromatography by applying it to a Phenyl-Sepharose CL-4B column 
equih'brated with 1.5M ammunium sulphate. The column was eluted with water, and 
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fractions containing GFP were identified by green fluorescence when illuminated with 
365nin UV light. To GFP containing fractions was added one volume of 20mM Tris, 
HQ (pH 7.5) and these were subjected to anion exchange chromatography by applying 
them to a Q-Sepharose column. The column was eluted with 20mM Tris, HCl (pH 7.5) 
5 + l.OM NaCl. GFP containing fractions were identified by green fluorescence when 
illuminated with 365nm UV light. GFP containing fractions were subjected to 
gelfiltration by applying them to a Superose-12 column equilibrated with lOOmM Na- 
phosphate buffer (pH 8.0). The column was eluted with lOOmM Na-phosphate buffer 
(pH 8.0) and fractions containing GFP were identified by green fluorescence when 
10 illuminated with 365nm UV light. The resulting GFP preparation was greater than 95% 
pure as judged by HPLC analysis. 

In vitro GFP phosphorylation assay 

For in vitro phosphorylation of GFP, 0.5/ig wild-type GFP (purified as described above) 
in 40mM Tris, pH 7.4, 20mM MgOAc and 0.2mM ATP (all from Sigma, St. Louis, MO, 

15 USA) was incubated for 1-60 minutes at 3TC with 0-20 casein units of the catalytic 
subunit of the cAMP dependent protein kinase (Promega, Madison, WI. USA) and 0- 
200/xM cAMP dependent protein kinase inhibitor. Emission (excitation wavelength 
395nm or 470nm) and excitation (emission wavelength 508nm) spectra were acquired 
for all samples using the L550B Luminescence Spectrometer and the FL data Manager 

20 software package (Perkin Elmer). The spectra were subsequently compared by using 
the Graph Builder software package (Perkin Elmer). 

As can be seen from Fig. 3, the fluorescence intensity of wild-type GFP increases 
approximately two-fold when incubated with the catalytic subunit of the cAMP 
dependent protein kinase. Moreover, Sfim cAMP dependent protein kinase inhibitor 
25 inhibits the effect of the catalytic subunit of the cAMP dependent protein kinase. 

Figure 3 shows emission spectra of O.SfMg wild-type GFP (purified as described in 
Example 6) in 40mM Tris, pH 7.4, 20mM MgOAc and 0.2mM ATP (all from Sigma), 
incubated for 5 minutes at 3TC with 10 casein units of the catalytic subunit of the 
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cAMP dependent protein kinase (Promega) with or without 5^M cAMP dependent 
protein kinase inhibitor (PKI). The control (w/o PKA) was incubated 5 minutes at 3TC 
without the catalytic subunit of the cAMP dependent protein kinase. The excitation 
wavelength was 395iun. RFI in the figure means relative fluorescence intensity. 

5 Example 7 

Characterization of wild-type GFP as a PKA activity probe in mammalian cells. 

The green fluorescent proteins exhibiting phosphorylation-induced changes in 
fluorescence are investigated further by expression in mammalian cells. 

Inserts of the respective plasmids are isolated by digestion with the restriction 
10 endonucleases Hindlll and BamHI and ligated into the Hindlll and BamHI sites of the 
MCS of the mammalian pZEO-SV vector (Invitrogen, San Diego, California, USA). 
Subsequently, Baby Hamster Kidney (BHK) are transfected with the resulting plasmid 
constructs according to the standard calcium phosphate-DNA precipitate protocol (cf. 
Sambrook et al., op. cit. at 16.33-16.35). Stable transfectants with high expression of the 
15 recombinant probes are identified and cloned after 6-14 days in culture by quantifying 
the fluorescence in an image analysis system, which consists of a Nikon Diaphot 200 
microscope with a temperature controlled stage, a slow scan CCD camera (T.I.L,L, 
Photonics), a polychromatic light source (T.I.L,L. Photonics), and a PC based image 
analysis software package (FTJCAL from T.LL.L. Photonics). Alternatively, the 
20 fluorescence properties are monitored in a photometer based system. In this system the 
CCD camera is replaced by a photomultiplier D104 (PTI, Canada). 

The clones are cultured for a couple of days in glass coverslip chambers (NUNC, 
Copenhagen, Denmark) before image analysis. 

The ability of the clones to detect changes in protein kinase A activity is characterized 
25 by elevating intracellular cAMP level by challenging the cells with forskolin (0.1-10 ^tM) 
or dibutyryl-cAMP (1-100 /iM) and monitoring the associated change of spectral 
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properties. The stimulant-induced changes of fluorescence properties are monitored 
continuously using above mentioned imaging system. Combining imaging with 
photometry makes it possible to characterize the response of the recombinant probes 
in both high spatial and high temporal resolution. 
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CLAIMS 

1. A method of detecting a biologically active substance affecting intracellular processes, 
the method comprising 

(a) culturing a cell containing a DNA sequence coding for 

5 (i) a green fluorescent protein (GFP) having a protein kinase recognition site, or 

(ii) a green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second 
messenger or an enzyme recognition site, or 

(iii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
10 and an enzyme recognition site or a binding domain for a second messenger 

under conditions permitting expression of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample suspected of containing a biologically active 
substance affecting intracellular processes, and 

15 (d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), such 
change being indicative of the presence of a biologically active substance in said sample. 

2. A method of detecting a biologically active substance affecting intracellular processes 
according to claim 1, the method comprising 

20 (a) culturing a cell containing a DNA sequence coding for 
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(i) green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 

(ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
5 and a binding domain of a second messenger or an enzyme recognition site 

under conditions permitting expression of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample suspected of containing a biologically active 
substance affecting intracellular processes, and 

10 (d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), such 
change being indicative of the presence of a biologically active substance in said sample. 

3. A method according to any one of the preceding claims, wherein the cell is a 
eukaryotic cell. 

15 4. A method according to any one of the preceding claims, wherein the cell is a yeast 
cell or a mammalian cell. 

5. A method according to any one of the preceding claims, wherein the binding domain 
is a receptor. 

6. A method according to any one of the preceding claims, wherein the binding domain 
20 is a cyclic AMP receptor or a part thereof capable of binding cyclic AMP. 

7. A method according to any one of the preceding claims, wherein the en2yme 
recognition site is a protein kinase recognition site. 
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8. A method according to any one of the preceding claims, wherein the protein kinase 
recognition site is selected from the group consisting of protein kinase A, protein kinase 
Q the insulin receptor, and the Src kinase. 

9. A method according to any one of the preceding claims, wherein the gene encoding 
5 GFP is derived from Aequorea victoria. 

10. A method of characterizing the biological activity of a substance with biological 
activity, the method comprising 

(a) culturing a cell containing a DNA sequence coding for 

(i) a green fluorescent protein having a protein kinase recognition site, or^ 

10 (ii) a green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second 
messenger or an enzyme recognition site, or 

(iii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site 

15 under conditions permitting expression of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample of a biologically active substance affecting 
intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
20 change in the fluorescence compared to the fluorescence measured in step (b), said 

change being characteristic of the biological activity of the biologically active substance 
in said sample. 
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11. A method of characterizing the biological activity of a substance with biological 
activity according to claim 10, the method comprising 

(a) culturing a cell containing a DNA sequence coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
5 substituted, inserted or deleted to provide a binding domain of a second messenger 

or an enzyme recognition site, or 

(ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
and a binding domain of a second messenger or an enzyme recognition site 

under conditions permitting expression of the DNA sequence, 

10 (b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample of a biologically active substance affecting 
intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), said 

15 change being characteristic of the biological activity of the biologically active substance 
in said sample. 

12. A method according to any one of claims 10 or 11, wherein the cell is a eukaryotic 
ceU. 

13. A method according to any one of claim 10, 11 or 12, wherein the cell is a yeast cell 
20 or a mammalian cell. 

14. A method according to any one of claims 10, 11, 12 or 13, wherein the binding 
domain is a receptor. 
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15. A method according to any one of claims 10, 11, 12, 13 or 14, wherein the binding 
domain is a cyclic AMP receptor or a part thereof capable of binding cyclic AMP. 

16. A method according to any one of claims 10, 11, 12, 13, 14 or 15 wherein the 
enzyme recognition site is a protein kinase recognition site, 

5 17. A method according to any one of claims 10, 11, 12, 13, 14, 15 or 16, wherein the 
protein kinase recognition site is selected from the group consisting of protein kinase 
A, protein kinase C, the insulin receptor, and the Src kinase. 

18. A method according to any one of claims 10, 11, 12, 13, 14, 15, 16, or 17 wherein 
the gene encoding GFP is derived from Aequorea victoria. 

10 19. A cell containing a DNA sequence coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 

(ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
15 and a binding domain of a second messenger or an enzyme recognition site, 

and capable of expressing said DNA sequence. 

20. A cell according to claim 19, which is a eukaiyotic cell. 

21, A cell according to any one of claims 19 or 20, which is yeast cell or a mammalian 
ceU. 

20 22. A cell according to any one of claims 19, 20 or 21 wherein the binding domain is 
a receptor. 
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23. A cell according to any one of claims 19, 20, 21 or 22, wherein the binding domain 
is a cyclic AMP receptor or a part thereof capable of binding cyclic AMP. 

24. A cell according to any one of claims 19, 20, 21, 22 or 23 wherein the enzyme 
recognition site is a protein kinase recognition site. 

5 25. A cell according to any one of claims 19, 20, 21, 22, 23 or 24, wherein the protein 
kinase recognition site is selected from the group consisting of protein kinase A, protein 
kinase C, the insulin receptor, and the Src kinase. 

26. A cell according to any one of claims 19, 20, 21, 23, 24 or 25 wherein the gene 
encoding GFP is derived from Aequorea victoria, 

10 27. A DNA construct comprising a DNA sequence coding for 

(i) green fluorescent protein wherein one or more amino acids have been 
substituted, inserted or deleted to provide a binding domain of a second messenger 
or an enzyme recognition site, or 

(ii) a hybrid polypeptide of green fluorescent protein (GFP) or a modified GFP 
15 and a binding domain of a second messenger or an enzyme recognition site. 

28. A DNA construct according to claim 27, wherein the binding domain is a receptor. 

29. A cell according to any one of claims 27 or 28, wherein the binding domain is a 
cyclic AMP receptor or a part thereof capable of binding cyclic AMP. 

30. A DNA construct according to any one of claims 27, 28 or 29 wherein the enzyme 
20 recognition site is a protein kinase recognition site. 
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31. A DNA construct according to any one of claims 27, 28, 29 or 30, wherein the 
protein kinase recognition site is selected from the group consisting of protein kinase 
A, protein kinase C, the insulin receptor, and the Src kinase. 

32. A DNA construct according to any one of claims 27, 28, 29, 30 or 31 wherein the 
5 gene encoding GFP is derived from Aequorea victoria, 

33. A method of detecting a biologically active substance affecting intracellular 
processes according to claim 1, the method comprising 

(a) culturing a cell containing a DNA sequence coding for wild-type green fluorescent 
protein (GFP) having a protein kinase recognition site under conditions permitting 

10 expression of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample suspected of containing a biologically active 
substance affecting intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
15 change in the fluorescence compared to the fluorescence measured in step (b), such 

change being indicative of the presence of a biologically active substance in said sample. 

34. A method according to the preceding claim, wherein the cell is a eukaryotic cell. 

35. A method according to any one of claims 33 or 34, wherein the cell is a yeast cell 
or a mammalian cell. 

20 36. A method according to any one of claims 33, 34 or 35 wherein the protein kinase 
recognition site is selected from the group consisting of protein kinase A, protein kinase 
C, the insulin receptor, and the Src kinase. 
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37. A method according to any one of claims 33, 34, 35 or 36 wherein the protein 
kinase recognition site is a protein kinase A recognition site. 

38. A method according to any one of claims 33, 34, 35, 36 or 37, wherein the DNA 
sequence encoding GFP is derived from Aequorea victoria. 

5 39. A method according to any one of claims 33, 34, 35, 36, 37 or 38, wherein the DNA 
sequence is identical to the nucleotide sequence of Fig. 4a or any functional analogue 
thereof. 

40. A method of characterizing the biological activity of a substance with biological 
activity according to claim 10, the method comprising 

10 (a) culturing a cell containing a DNA sequence coding for a wild-type green fluorescent 
protein having a protein kinase recognition site under conditions permitting expression 
of the DNA sequence, 

(b) measuring the fluorescence of the cell, 

(c) incubating the cell with a sample of a biologically active substance affecting 
15 intracellular processes, and 

(d) measuring the fluorescence produced by the incubated cell and determining any 
change in the fluorescence compared to the fluorescence measured in step (b), said 
change being characteristic of the biological activity of the biologically active substance 
in said sample. 

20 41. A method according to claim 40, wherein the cell is a eukaryotic cell. 

42. A method according to any one of claims 40 or 41, wherein the cell is a yeast cell 
or a mammalian cell. 
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43. A method according to any one of claims 40, 41 or 42, wherein the protein kinase 
recognition site is a protein kinase A recognition site. 

44, A method according to any one of claims 40, 41, 42 or 43, wherein the DNA 
sequence encoding GFP is derived from Aequorea victoria. 

5 45. A method according to any one of claims 40, 41, 42, 43 or 44, wherein the DNA 
sequence is the nucleotide sequence of Fig. 4a or any functional analogue thereof. 

46. A DNA construct containing the nucleotide sequence of Fig. 4a. 

47. A transformation vector containing the nucleotide sequence of Fig. 4a^ 

48. A transformed cell containing the DNA construct of claim 46. 

10 49. A transformed cell according to the preceding claim, characterised by being a 
mammalian cell* 

50. The transformed E. coli having the deposition No. DSM 10260. 

51. An in vitro assay for measuring protein kinase A activity, wherein purified wild-type 
GFP is added to a biological sample, preferably a cell extract, and any change in 

15 fluorescence of the GFP is recorded. 

52* An in vitro assay for monitoring cAMP levels in a sample, wherein wild-type GFP 
and tetrameric PKA having two regulatory and two catalytic subunits are added to a 
sample, preferably a cell extract, and any change in fluorescence is recorded. 
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Fig. 2 
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Nucleotide sequence (7 64 bp) of GFP 
(Hind3-EcoRl fragment) 

AAGCTTTATGAGTAAAGGAGAAGAACTTTTCACrGGAGTT 

GrCCCAATTCTTGTTGAATTAGATGGCGATGTTAATGGGC 

AAAAATTCTCItjTTAGTGGAGAGGGTGAAGGTGATX^CAAC 

ATACGGAAAACTTACCCTTAAATTTATTTGCACTACTGGG 

AAGCTACCTGTTCCATGGCCAACGCTTGTCACTACrTTCT 

CnTATGGTCTTCAATCXTTTTTCAAGATACCC^ 

GAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGCT 

TATGTACAGGAAAGAACTATATTTTACAAAGATGACGGGA 

ACTACAAGACACGTGCTGAAGTCVVAGTITGAAGGTX^ 

CCTTGTTAATAGAATCGAGTTAAAAGGTATTGATriTAAA 

GAAGATGGAAACATTCTTGGACACAAAATGGAATACAACr 

ATAACrCACATAATGTATACATCATGGCAGACAAACCAAA 

GAATGGCATCAAAGTTAACTTCAAAATrAGACACAACATr 

AAAGATGGAAGCGrTCAATTAGCAGACCATTATCA\CAAA 

ATACTCCAATTGGCGATGGCCCTGrCCri'riACCAGACAA 

CCATTACCrGTCCACGCAATCTGCCCrTTCCAAAGATCCC 

AAOGAAAAGAGAGATCACATGATCCTTCTTGAGTTTGTAA 

CAGCIOTTXjGGATTACACATGGCATCKI^TGAACTATA 

ATAAATGTCCAGACTrCCAATTGACACTAAAGGGATCCGA 

ATTC 

Fig. 4a 
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Amino acid sequence: 

Stajrt codon ATG corresponds to position 8 in the nucleotide sequence 
above and stop codon TAA corresponds to position 722, 



1/1 

atg agt 
Met ser 



61/21 
gat gtt 
asp 



va 1 



aaa 
lys 



aa t 
asn 



gaa gaa 
glu glu 



ct t 
leu 



181/61 

aaa ctt acc 

lys 4cu ihr 



ggg 



c 1 1 
1 eu 



caa 
gin 



aaa 
lys 



aaa 
lys 



1 1 1 
phe 



tic 
phe 



at t 
i le 



1 1 c 
phe 



t c t 
ser 



tgc 
cy s 



ac c 
ihr 



gtt 
va 1 



ac t 
Ihr 



gga 

giy 



agt 
ser 



act 
Ihr 



31/11 
gtt 
va 1 



gtc 
va 1 



91/31 
gga gag 
gly glu 

211/71 
ggg aag 
gly lys 



cca 
pro 



ggt 

gly 



c ta 
1 eu 



a 1 1 
i le 



gaa 
glu 



cc t 
pro 



c 1 1 
leu 



ggt 

gly 



gtt 
va 1 



gtt 
va 1 



gat 
asp 



cca 
pro 



gaa 
glu 



gca 
a 1 a 



tgg 
trp 



t ta 
i eu 



aca 
thr 



cca 
pro 



gat 
asp 



t ac 
tyr 



acg 
thr 



241/81 
cat gac 
his asp 



1 1 1 
phe 



301/101 
aaa gat gac 
lys asp asp 

361/121 
aat aga ate 
asn a rg i 1 e 

421/141 
atg gaa tac 
nie t glu tyr 



1 1 c 
phe 



ggg 

gly 



gag 
glu 



aac 
asn 



aag 
lys 



aac 
asn 



1 1 a 
I eu 



tat 
tyr 



agt 
ser 



tac 
tyr 



aaa 
lys 



aac 
asn 



gcc 
ala 



atg 
me t 



aag 
lys 



ggt 

gly 



tea 
ser 



aca 
thr 



at t 
i le 



ca t 
h i s 



ccc 
pro 



eg t 
arg 



gat 
asp 



aa t 
asn 



gaa 
glu 



get 
a 1 a 



1 1 1 
phe 



gta 
va 1 



271/91 
ggt tat 
gly tyr 

331/111 
gaa gtc 
glu 



va I 



391/131 
aaa gaa 
lys glu 

451/151 
tac ate 
tyr i le 



gta 
va I 



aag 
lys 



gat 
asp 



atg 
met 



cag 
gin 



1 1 1 
phe 



gga 

gly 



gca 
a 1 a 



gaa 
glu 



gaa 
glu 



aac 
asn 



gac 
asp 



aga 
arg 



ggt 

gly 



at t 
i le 



aaa 
lys 



act 
thr 



gat 
asp 



ctt 
1 eu 



cca 
pro 



a t a 
lie 



acc 
thr 



gga 

gly 



aag 
lys 



ggc 

gly 



gga 

gly 



c t I 
1 eu 



121/41 151/51 

gtc act act ttc tet tat ggt gtt caa tgc ttt tea aga tac cca gat eat atg aaa cag 
val thr thr phe ser tyr gly va I gin cys phe ser arg tyr pro asp his met lys gin 



1 1 1 
phe 



c 1 1 
I eu 



cac 
hi s 



aa t 
asn 



tac 
tyr 



gtt 
va 1 



aaa 
lys 



ggc 

gly 



481/161 511/171 

ate aaa gtt aac ttc aaa att aga cac aac att aaa gat gga age gtt caa tta gca gac 

ilc lys val asn phe lys ile arg his asn ile lys asp gly ser val gin leu ala asp 

541/181 571/191 

cat tat caa caa aat act cca att ggc gat ggc cet gtc ctt tta cca gac aac cat tac 

his tyr gin gin asn thr pro ile gly asp gly pro val leu leu pro asp asn his tyr 



601/201 631/211 

etg tec acg caa tci gcc ctt tec aaa gat ccc aac gaa aag aga gat cac atg ate 
leu ser thr gin ser ala leu ser lys asp pro asn glu lys arg asp his met ile 

661/221 691/231 

ctt gag ttt gta aca get get ggg att aca cat ^ge atg gat gaa eta tac aaa taa 
leu glu phe val thr ala ala gly ile thr his gly met asp glu leu tyr lys CXH 



c 1 1 
I eu 



Fig. 4b 
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